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Multivariate Statistical Inference under Marginal Structure^ !• 

by 

- Leon Jay Gleser 
The Johns Hopkins University and Educational Testing Service 

and 

• Ingram Olkin 

Stanford University and Educational Testing Service 

1* Introduction 

In this paper we are concerned with statistical inference concerning 
the parameters of k multivariate normal populations. Several models 
are considered in which the parameters hSve certain hierarchical relation- 
ships. These models may arise in a variety of scientific contexts^ but 
our concern with tbi'^ prob3.em originated in the context of testing the 
hypothesis that k chological tests are parallel forms of the same test. 

Suppose that we are utilizing k different (collections of) psychologi- 
cal tests. These (collections of) tests have one subtest T^ in common^ 
and are designed to be statistically equivalent (parallel) to one another. 

The components of the g -th test can be represented as (T^^T ) ^ where 

^ g 

Tq is the subtest common to all k psychological tests, and T is the 
subtest peculiar to the g -th test, g = 1, 2, ...,k - 

In one possible experimental design^ each of the k psychological 
tests is given to a different"^ group of persons. The k groups of persons 
are randomly constructed (of possibly unequal sizes), and are considered 
to be (statistically) homogeneous with respect to the psychological traits 
being measured. The score of a single person from the g -th group on the 

"H*7ork supported in part by the Educational Testing Service, the Air Force 
O'ffice of Scientific Research (contract Flfi^620-70-C-0066) at Johns Hopkins 
Q University, and the National Science Foundation (Grant GP-52526x) at Stanford 
ERJC University. 
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g -th test (Tq.T^) is denoted by (x^^^^x^^^) , where x^^^ is the score 

on subtest and x^^^^ is the score on the remainder, T . of the 

0 1 g 

test. The scores x^^^^x^^^ may be scalars^ or they may be (rov/) vectors^ 

depending on whether the subtests • T themselves are considered to 

0 g 

consist of one^ or of more than one^ parts. However^ the dimensions of 
Xq"^'^Xq^\ . . and Xq^^ are the same (since they are scores on the common 
subtest ), and the dimensions of and are the 

same (since the subtests ^]_^^2^ * • "^^j^ designed to be statistically 

equivalent to one another). To be specific, let us assume that T^ con- 
sists of q parts, so that the common dimensions of x^"^\x^^^, . . -^x^^^ 
are 1 x q , and let us assume that T.,T^, .•.,T, each consist of p - q 
parts (q < p) , so that the common dimensions of x^^xj^^^ . . .,x^^^ 
are 1 x (p - q) . 

It is assumed that the score of an individual on any test has a 
multivariate noimal distribution, and that scores of individuals are 
mutually statistically independent* Thus, in describing a statistical 
model for this problem, it remains only to specify the mean vectors and 

covariance matrices of the score vectors (:)c^"^^,xp^),(x^^^,x^^^), . . ., 
(k) (k) 

^,x£ ^) . These parameters are perhaps best described by a table: 



ERIC 



Parameters 


Group (Test) 


1 


2 


• • • 


k 


iMean Vector 










• mm 




Covariance 
Matrix 


^00 ^01 

2(1) 2(1)] 

\^io hil 






^00 ^01 

I ho hi / 




m m m 




1^00 ^01 

i^o hil 





(g) 

For the g -th group (the individuals who take the g -th test); 

is the expected score (vector) on the parts of -the subtest ^ [x^^ is 

(g) 

the expected score (vector) on the parts of subtest T ^ 2^^^ is the co- 

^ (g) . 

variance matrix among the scores on the parts of subtest ^ 2^^^ is 
covariance matrix among the scores on the parts of subtest T ^ and 
^01^ = (^Iq^)' is the matrix of covariances between scores on parts of 
and scores on parts of T .We write 

0 ^ cr 



/.(g) jg)\ 

^00 ^ 01 



y(g) v(g) 

^10 ^11 



g = 1,2, ... ,k 



I 



,(g) 



■ (g) 



ThuS; [i^^^ is the mean vector and 2^ ' is the covariance matrix of 
the distribution of the score of a single person from the g -th group on 
the g -th test (T^-T ) . 

If the k groups are truly (statistically) homogeneous with respect 
to the psychological attributes being measured^ then^ since all k groups 
take subtest ^ we would expect 



(1.1) 



. Jl) . J2) _ (k) 



■00 



00 



00 



to be true, regardless of whether or not the k tests (^q, T^), (T^, Tg), 
•••,(Tq,Tj^) are parallel forms (statistically eauivalent). The hypothesis 
that all k tests are parallel forms is 



To verify that the k tests (T^.T^), (T^.TgL • . (T^.Tj^) are indeed 

parallel forms ^ given that the k groups are statistically homogeneous, 

we can test the hypothesis H versus the more general alternative 

mvc 

hypothesis H . 

m vc' 

In some instances we may believe that the noncommon parts T^^T^^ . . .^T^ 
of the k tests are not necessarily statistically equivalent, and we may 
have some doubts as to whether or not the k groups ha v^ltg'^Se ^aaft ^^^" 
performance on the common subtest (i-e., v/hether ^10*^^ =- = ... = 

\1q However, V7e may continue to believe that the parts o"* subtest 

have the same interrelationships (variances and covariances) in all 
k groups. In such a case, our given hypothesis is 

(1.5) H • z^^) - z(2) - - t(^^ 

^-^'^^ vc» • ^00 ■* ^00 - = ^00 ' 

and we may want to test H , , or H against this hypothesis. (Note 

m vc mvc 

that H^.^^ implies H^t^^i which in turn implies H^^, .) Acceptance of 

the hypothesis H^,^^, as against the hypothesis H^^, means that all k 

groups respond similarly to subtest --in other words, the k groups 

are marginally homogeneous in their response to subtest . Acceptance 

of the hypothesis H^,_ as against H , means that the k tests are paral- 

m\rc vc 

lei forms and that the k groups are homogeneous in their responses to 

the tests (Tq.T^). (T^.Tg ),..., (T^,Tj^) . 

Besides hypotheses H , H , , , H , , various intermediate 

mvc ^ m^vc^ ' vc* ^ 

hypotheses may oe of interest. For example, the hypotheses: 



(1» H,,: z(l) = z(2)= ... z(^> 



and 



(1.5) H , = = 



= K 



(k) 



,(1) _ .(2) 



(k) 



may be of concern. Figure 1 indicates the logical relationships among the 



hypotheses (models )H ^*H,,;H,^K , and H • 

mvc ^ m'vc' ^ vc' vc ^ m'vc 



^ m'vc vc^ 



mvc_ ^ ^ vc' 



Figure 1* Logical relationships among the hypotheses. An 

arrow indicates implication. Thus, H H , 

^ ^ mvc m vc 

means that hypothesis H implies hypothesis 
m'vc * 

The maximum likelihood estimators (MLE) of the parameters under the 
various models (the models defined by hypotheses ^ ^m'vc' * \c' ^ 

H , , and H ) are listed in Section 3 and derived in an Appendix at 
m'vc ' vc 

the end of this paper* Using the results given in the Appendix^ we may 
obtain likelihood ratio tests (LRT) between various pairs of hypotheses. 
These tests are given in Section In general^ the test statistics ob- 
tained from the likelihood ratio approach hav^ distributions similar in 
form to the distributions for the LRT for the multivariate analysis of 
variance and'to the distributions for Wilks's lambda test for the equality 
of covariance matrices [see Anderson (1958)], The exact distribution^are 
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those of products of powers of independent beta variates^ and are known to 
be very complicated in fonn* .However^ by using the Box (19^9) approxiiration^ 
we may obtain approximate levels of significance for these tests from a 
weighted sum of chi-square distributions (see Section k)* A numerical 
illustration of the computation and use of one of these likelihood ratio 
tests in a practical context appears in Section 5; this practical example 
is also used (and intr.>duced) in Section 5 to illustrate the differences 
in value of the MLE under the various models discussed in this paper. 
Before entering into a discussion of the various estimators and tests of 
hypotheses^ however^ we have a few further coinments to make concerning 
the underlying structure of the inferential problem. 

^ 2> The Underlying Structore 

Recall that we have k groups of individuals^ and that each indivi- 
dual in the g -th g::oup takes the test (Tp.^T ) ^ g = 1^2^**.^k . The 

g 

score for the i -th individual in group g is (^oi^^'^li^^ ' "^"^ there 

are N individuals in group g ^ then we need only consider the sample 
g 

means : 

N Ng 

^^•■^^ ""O - N . , ""Oi ^ \ - ^ . , ""li > 
g 1=1 g 1=1 

and can summarize these means by x^ = {^q ^'^j^ ) • 

If the subtests Tq and T each have only one part (i.e.^ each 

of T^ and T are summarized by one score), then x^ and x;'^'' are 
0 g '01 

scalars# However^ we need not be restrictive about this^ since the theory 
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applies even when Tq and each have several parts (and are sum- 

marized by several scores)^ and y.^^^ and x|^^ are vectors. For 
example^ suppose there are two groups^ and the tests given to the groups 
are trade up as follows: 

Essay Questions Multiple Choice Questions 



Tests given to Group 1: 












I 


^1 


T 


^1 


Tests given to Group 2: 












T 

0 


^2 


T 
^0 


^2 



That is^ some of the essay questions and some of the multiple choice ques- 
tions are taken by both groups^ while other essay questions and other 
multiple choice questions differ between the two groups. Suppose that 
the common essay questions are summarized by a single score^ the distinct 
essay questions are summarized by a single score^ the common multiple 
choice questions are summarized by a single score^ and the distinct multi- 
ple choice questions are summarized by a single score. In this case^ 



-d) 



and 



(g) 



are (row) vectors with 2 components (i.e.^ x^^ is 



1x2^ g = 1^2 )^ and x^"^^ and x^^^ are (row) vectors with 2 
components. 

In the example provided in Sections 5 and 5^ there are three groups^ 
and we have the following foimat: 



SAT Verbal SAT Math SAT Verbal 

Operational Operational Experimental 



Test given to Group 1: 










^0 ' 


f 


^0^ 


I 


/ 


Test given to Group 2; 










— ^ 


\ 


^0' 








> 




> 






Test given to Group 5*. 


I 








/ 


T 




T 

0 




^5 



InXthjs^case, T^ has 2 parts (and is summarized by 2 scores), and T^^ , 

Tg , and T^ have 1 pan each. Hence x^-'-^ x^^^ , ; ^q^^ are all 1x2 

.(o\ -(-\\ 

vectors, while x]^ , x^''' , yi^ ' are scalars. - 

Returning to our dr.ta, we can define satrrple cross-product matrices 



(2,2) V 



(g) 



00 01 

v(s) v(g) 
10 11 



g = 1,2, .../K , 



where 



(3.3) - v<g'»"l^(.(f -J(«)).(.(f)-J(«)) 



is the sample cross-product matrix for the vector of scores of 

group g on subtest T^ ^ where 



(2,1.) vff)/l (x^f -x(^))'(x(f) -4^)) 
^ i=l 



is the sample cross-product matrix for the vector x(^^ of scores of 
group g on subtest T ^ and where 

N 

(2.5) V^^^ = ! fx^^) - x(^^)'fx^^^ - x^^^) V^^^ - Of^^h' . 

V^o; . ^ ^Xq^ X ; ^x x^ ; , v^^ j , 

1=1 



is the sample matrix of cross-products for group g between the scores on 

subtest and the scores on subtest T . In the context of our first 

g 

example, V^"^^ and V are both k x k matrices, and V^J^ , V^^^ , 

(l) (2) (1) (2) 
^01 ' ^01 ' ^11 ' ^11 ^ ^ ^ matrices. In the context 

of our second example, V^"^^ , V^^^ , and V^^^ are 5x5 matrices, 
^00^ ^ ^00^. ^ ^00^ 2x2 matrices, V , V^^) ^ and V^^) are 

2x1 matrices, and V^^^ , V , V are 1x1 matrices (i.e., 
scalars). 

When all of the score vectors (xq^^,x|^^) have multivariate noiinal 
distributions, and the performances of individuals on the tests arc- 
mutually statistically independent, then it is well known that the mean 
vectors x^"^^,x^2^, • . .,x^^^ and sample cross-product matrices V^"^\v^^\. 
V ^ together are jointly sufficient for inferences concerning the param- 
eters \i^^\[x^^\...,ix^^^^ , L^^\z^^\...,L^^^ of the model (see 
Anderson (I958)). Let us assume that we have already reduced our test 
score data to a summarization in terms of the quantities 
;W and v(l),v(2),...,vW . 



-10- 

It is known [Anderson (1958)1 that under the assumptions described 
above^ x^^^ and V^^^ are statistically independent of one another for 
g-1,2,...,^. Also (^(^V1)), r/^Kv^^"^ ),..., (^Hv^^)) are 
mutually statistically independent. Tte distribution of x^^^ is multi- 
variaue normal with density function 

(2.6) p(i^ = c(^) |e(s) n e^-f - ,(S))(Z(^))-I(x(g) - ^^^h 

(a:) 

whare c ^ is a certain constant depending upon p and N • The dis- 

tribution of V ' is Wishart with parameters n = W - 1 and E • 

(r) 

The density function of V^^' is given by 

(2.7) p(v(^^) = d(e)|E(e)r"e^^v(s)|^"g'^"^^/^ v(s)(2(g))-l] 

(k) 

where d*°' is a constant depending upon n and p . 

S 

Let 

i = (5Wj(^\. , v-(v(^',v(^',...,vW) , 

(2.8) 

To obtain rraximuin likelihood estimators (MLE) of \i and Z under the 
various models described in Section 1, we need to mxiraize the likelihood 

(2.9) p(x,v) = n Cp(x(s))p(v(s))J , 

g=l 



> 
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with respect to p. and Z under the r^^.-'tr' -^ions upon these parameters 

imposed by the hypotheses H « H , h , , H , . and H 

"^■^ mvc ^ m*vc ' vc* ^ m'vc ^ vc 

For simplicity of exposition^ we summarize t;he MLE's for \x and E 
under the various hypotheses in Section 5; proofs of the results are 
deferred to the Appendix. 

5- Maxiirum Likelihood Estimation 



In this section we smrmarize the maximum likelihood estimators (MLE) 

of the parameters (^i,2) for each of the five models ( H , , H . , , 

~ ~ ^ vc^ m*vc* ^ 

\z ^ \*vc ^ \yc ^ ^^s^^i^^^ Section 1. However, it is helpful 

to first consider a reparameterization wnich simplifies the analysis and 

heli^s to clarify our understanding of the results • 

In three of the five models described above (namely, in H^^, , 

H , , , and H , ), the restrictions on the parameters that are imposed 
mvc *^ mvc ^ 

by the model concern the parameters oT the marginal distributions of the 

scores x^?^ made by individuals on subtest . This, in the models 

defined by H and H , , , the marginal covariance matrices 
vc m vc 

^OO^^^OO^^ '-'^OO^ are constrained to be equal, while in the models de- 
fined by H^,^^ and H^,^^, , the marginal expected score vectors 
^O^^^^o"^^^ '-^^O^^ assumed to be equal- To isolate the marginal 

parameters \x^^^ and E^J^ , g := 1,2, •••,k , we are led [Lord (10^5), 
Anderson (1957)^ Bhargava (196^)] to consider breaking the likelihood 
(2,9) into two factors: (i) the density function of the marginal quan- 
tities 5^^"^^,x^^^, ..•,x^^^ , V^J^,V^^^, ...,V^J^ , and (ii) the conditional 
density function of the sufficient statistic (x,v) given these marginal 
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quantities. If we do this [ see -Equations (A. l) and (A. 2) of the Appendix], 
we find that 



(5.1) 



g = 1,2, ...,k , appear as natural parameters in this representation. Note 
that p^^^ is the q x p - q matrix of j:-egression coefficients (slopes) 
and a^^^ is the 1 x p - q vector of intercepts for the regressions of 
the elements of x^^^ on x^^) (that is, ^{yS^^^.^^h a^^'^ ^ ^^^h^'''^ ) • 
Further^ is the residual covariance matrix of x^^^ after the 

dependence of x[^^ on x^^^ has been removed by regression. Thus, 
the parameters in (5.l) are not only of interest in connection with 
finding the MLE of (ti^Z) ^ but are also of interest in their own right. 
It is nc" 6-^fficu3t 'oo show that (n,2) and (5-l) are equivalent 



(g) (g) 

parameterizations. Equation (5*1) represents ^ ^ ' ^ 

^ and Z^f , g = 1 
(ji^Z) . On the other hand, 



(s) 



p^^^ , and Z^I^Q , g = 1,2, ...,k , as functions of the parameters 



(5.2) 



,(g) - 



-(g) _ 



/(g) v(g)^ 

^00 ^01 

v(g) v(g) 
^10 ^11 



(^(g)^«(g) , ,(g)p(g)) , 

I r(g) iS^'^'fJ^h 
^00 00 ' 



\ 



I 
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(g) (a) ^(g) 

g = 1,2, ...,k , represents (^,1.) as functions of , O.^"^' , Sqo 

P^^^ , and Z^f^o , g = 1,2,... ,k . 

Corresponding to^-the parameters a^^^ , p^^^ , and ^^.^^ > "^ay 
define the sample quantities : 

_ -(g) _ i(g)(v(s))-V^) , b(s)= (vi^^)"Mf^ , 

"1 0 ^00 ' ^01 ' ^00 ' ^01 ^ 

(5-5) 

^ \l.O = ^11 " \o %0 ^ %1 ' 



4 



for gJ=* ^rrA • • , k . Note [ sC^he Appendix] that x^^^ ^ a 



r 

(\)'^"^'^00^-^ ^^^^ ' ^\^''^ll-0 respect: ve MLE of vS^^ , 

^(g) ^ 2^g)^^ ^p(g) ^ and zj?^ , g = 1,2, ...,k , vmen the parameters 

\J\J . ^ ±± V 

(n^^^Z^^^) of^e distribution- of the scores for the individuals in any 

. / (k) ^(k). . 

one group are functionally unrelated to the parameters (ji ■ ; oi 

the distribution of the scores of the individuals in any other group, 
' g / h . In \jn follows, we refer to these maximuin likelihood estimators 
^ '.as the "usual" (unrestricted)westimators of the corresponding parameters. 



For example, we refer to B^^^ as the "usual" estimator of p^^^ 



X 



g = 1,2, ...,k . 

We are now in f^ition to give explicitly the MLE of the parameters 



/ 



and 
vc 



(ji,Z) uader each of the models H^^, , %y*ve* ^^^v^ ^ 

H described in Section 1* Calculatioj f5of these MLE v/ill be illustrated 
mvc ^ 

by tl^e follov;ing practical example. 
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3*0 An Illustrative Example 

The Scholastic Aptitude Test (SAT) of the College Entrance Examination 
Board contains items designed to measure verbal ability and items designed 
to measure mathematical ability. The test is given to a number of indi- 
viduals at a time; different individuals who take the test may receive 
different forms of the In each such form^ certain verbal items 

and certain mathematical irems are common to all forms of the test. 
Other items^ however^ differ from form to form. The common items are 
used for the operational (measurement) purposes of the SAT; the differing 
items are included for certain experimental purposes. Suppose k such 
forms of the SAT exist. Then at a given administration of the test^ each 
form is given to the same number of individuals^ and forms are assigned 
to individuals by a process similar to the technique of (randomized) 
systematic sampling used in sample survey designs. 

The score on the g -th form of the SAT -can be suinmarized by three 
numbers (scores): (i) the total score on those verbal items common to all 
forms (SAT Verbal Operational Score)^ (ii) the total score on those mathe- 
matical items common to all forms (SAT Mathematical Operational Score )^ 
and (iii) the total score on those ite^s peculiar to the g -th form 
(sat Experimental Score). The common parts (SAT Verbal Operational^ 
SAT Mathematical Operational) of each form constitute subtest T^ in the 

terminology of Sections 1 and 2. Thus, the score vector of the 

Oi 

i -th individual in the group of individuals taking the g -th form is 
a 1x2 dimensional vector. The unique part (SAT Experimental) on the 
g -th form constitutes subtest T ^ g = 1^2^...^k . The score vector 
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U) 

^li ^ individual who takes the g -th form of the SAT is thus 

a scalar (a 1x1 dimensional vector). 

In Aprils 1971^ several thousand individuals took the SAT at testing 
centers across the country. A sample of 100 individuals was chosen from 
among all those individuals who took a given form of the SAT^ for each of 
5 different forms (T^^T^) , (^o^^2^ ^ ^^0^^5^ which the experi- 

mental items were comparable (in the present case^ all experimental items 
were verbal items). Thus^ q = 2^ P = 5^ k=5^ and = = = 

100 . The test data have been summarized in terms of the sample mean 

-(g) (r) 
vectors x^ ^ and sample cross-product matrices , separately for 

each form (group)^ g = 1^2^5 • These summarizations appear in Table 1. 

Table 2 gives the "usual" estimators of the parameters (ji^S) and of the 

parameters defined in (5-l)- The values of these "usual" est\mators serve 

to provide comparisons to the values of the MLE of the parameters under 

each of the models H,^H.,^H ^H, ,H , which we 

vc' m'vc' vc m vc ^ mvc 

discuss below. 



5-1 MaximuTA Likelihood Estimators under H . 
vc' 



We begin by considering the most general of the five models described 

in Section 1. In this model, which is defined by the hypothesis H . , 

vc' ^ 

(cr) 

the score vectors x^?^ of individuals in all k groups have a common 

covariance matrix Z^^ . That is, under this model, the parameters (ji,Z) 

of the distribution of the scores are restricted to belong to the parametric 

sub space o) , , where 
vc* ^ 
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Table 1. Summarization of Test Data for the Illustrative Example 





^2 


= = 100 

5 


, q = 2 , 


-7 

P = 5 , 




x^^^ = (55.86, 22.52, 


11^.77) , 




x^^^ - (55.62, 25A5, 


11^.55) , 




x^^^ = (56.05, 2k.H0, 








fiko, 16k 

91,0114- 
[ 59,581 


91,0114- 
65,651^ 
59.01I4 


59,58l\ 

59,0114 

27,525/ 




■1 


^151^,980 
96, 596 
\ 58,11^1 


9^,396 

11,919 
14-2,14-11 


58, 1I41' 
I42, 1411 
26,6llj 


v(5) 


■1 


to, 751 
lOh, 106 

[ 71,115 


10I4-, 106 
75,206 
1^6,765 


11, 115\ 
^^6,765 
32,131 j 
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Tables 2. The ''Usual" Estimators of the Parameters (|i^E) for the 

Illustrative Example 



id) 

"00 

i(2) 
"00 



?(5) 
^00 





= x(^) = (55.86, 


22.52, 111. 77) 


) 




- 5(2) . (55.62, 


25.i^5, 111. 55) 


) 






2l|.l|0, 16.21) 


) 


£(1) 


100 ^ - 1 


910. Ill 

595.81 


910. Ill 

656. 51I 
590. Ill 


595. 8l\ 
590. Ill 
275.25/ 


£(2) 


= J-_ v(2) = 1 
100 1 


^15^19.80 
965.96 
581. 111. 


965.96 
779.19 

l|2l|.ll 


581. Ill \ 
l|2l|.lll 
266.71/ 


£(5) 


100 1 


(1607.51 

lO'+l.Oo 

\ 711.15 


IOI1I.O6 
752.06 
1167.65 


711.15* 

I167.65 

^27.57/ 



-(1) 

'^0 
*^0 



0 

/iiioi.6ii 
\ 910. Ill 

/151I9.80 
\ 965.96 



(55.86, 22.52) , 
(55.62, 25.145) , 
(56.05, 21I.1IO) , 



(1) _ 



-^(2) 



(5) 



0.5502 , 
-0.0705 , 
0.1528 , 



910 
656 



•^^\ g(l) 



0.5928\ 
0.0I197' ' 



4i?o = ^9.82 



965.96' 
779.19, 



^(2) _ .'0.5608\ 



/16OT.51 10iil.06\ ^(5) 
\10iil.06 752.06/ ' ^ 



;0.0980/ ' 4l-0 - ^^'^^ 



i0.5650^ £(5) _ 
\0.1226,' ' .^11-0 - ^^'^ 



vc 



-18- 

= f(!i^5)= = 4V = ^ 4o^ = ^00 ' ^00 arbitrary 

X q covariance matrix) 

Because the marginal covariance matrices ^qO^^^OO^^ * ' '^^00^ have a 
common value 2^^ ^ the pooled estimator 

g=l 

k 

for Z^^ , where N = E N , has intuitive appeal. Note that although 
00 n g 

g=-'l ^ 

the hypothesis H^^, puts no explicit restrictions relating the parameters 
Z^l^ and 2^1^ to 2^^|^^ and , respectively^ g ^ ^ > implicit 

restrictions upon the relationships between these parameters are imposed 

(b)\ 



by H , since 
vc* 



^00 ^01 



v(g) y(g) 



must be a positive semi -definite matrix for all g = 1^2^...^k . On the 

other hand, H , imposes no restrictions (explicit or implicit) 
^ vc 

relating the parameters p^^^ and ^[^Iq to p^^^ and 2^^^^ , g ^ h • 

This fact suggests that p^^^ and ^[^Iq be estimated by their "usual" 

estimators B^^^ and (N )''Vnf^n > respectively^" g = 1^2^. ..^k • 

^ g' 11*0 

Similarly^ since H^^, imposes no restrictions relating the para.aeters 



\x^^^ and a^^^ for one group to the corresponding parameters [x^^^ 
and a^^^ in any other group^ g / ^ ^ we think of estimating these 
parameters by their "usual" estimators ( \x^^^ by x^^^ , and a^^^ by 
a^^^ , g = 1,2,. ..,k ). 
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It is shown in the Appendix that these estimators, namely 



11.0^ N "ll.O ^ 



are Indeed the MLE of n^^^ , a^^^ , E^^ , p^^^ , and Z^^J^ 

respectively, g = 1,2, ...,k , when the model defined by H^^, holds. 

The MLE of (ji,Z) under H , can now be found by substituting (5.5) 
#v ^ vc 

into (5»2), This substitution results in the following MLE: 



\ 



^10^\^0^ %0 r\l'0 ^10 ^^00 ^no^^oo ^1 



for g = 1,2, ...,k . The actual values of the MLE (5.5) and (5.6) for the 
example described in Su'osection 5.0 are summarized in Table 5. 

Note that jl^^^vc') is equal to the "usual" estimator yS^^ for 
H^^^ , but that E^^^vc' ) differs from the "usual" estimator 
for E^^^ . Indeed, the difference between E^^^vc') and (N )""4^^^ 
equals 
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Table 5- Maximum Likelihood Estimators under H , for the 

vc 

Illustrative Example 
^^^\vc') = (55.86, 22.52, lk.77) , 

(T^^^vc') = (55.62, 25 A5, 11^.55) , 
n(5) 



n^^''(vc') = (56.0,7, 2h.hO, 16.21) , 



/1I152.98 
E^-^''(vc') = 971-72 

\ 619.014- 



II152.98 
971.72 

\ 619. 



971-72 
722.60 
■H7-5i 



971.72 
7^'2.6o 
•^£l.56 



6i^':i4- 
hv. .J. 
28"-'.. 714-/ 



ol9.59\ 
.121.56 



-(5) /ll^52.9S 
\ 614-6.55 

(55.86, 22.52) 

(55.62, 25. '45) 

(56.05, 2l4.1tO) 



ii^'^vc') 
.f\vc.) 
^P^(vc') 



971-72 6146.551 
722.60 14-14-1.52 , 
I4I41.52 500.69/ 



ol^^\vq' ) = 0.5502 , 
a^^^(vc'} = -0.0705 , 



a^^)-V( ' ) = 0.1528 , 



^00^^'' ' - \ 910. 1I4 6:>6.5i4j > 

/,,,,\ /0.5928\ -(2), ,v /b.56o8\ -(5)/ ,x /o.565o\ 
) = \0.0i497j ^ ^^"^ = lo.098oj ^ i5^^^(vc') = (o.^gj 

4i!o = ^9.82, Eg)Q(vc') = 15-1^0 , ^^^(vc'; = 11.90 . 



T 



1 
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*^10 00 



^ N , ^0 N "OO-* 
s-1 



\ 



^10 00 



/ 



where I is the q x q identity matrix. 

q 



5.2 Maximum Likelihood Estimators under H^,^^, 



(g) 

In the model defined by the hypothesis H^,^^, , the scores x^^ of 
individuals on subtest are assumed to be identically distributed 

according to a q -variate nomal distribution with mean vector and 
covariance matrix I,^ . That is, under the hypothesis H^,^^, , the 
parameters are restricted to belong to the parametric subspace 

0) , , , where 
m vc* ' 



03 



*vc' " ^0*^^ = . . . = \x^^ = is an arbitrary 1 x q vector, 

Z^^'^ = ... = 2qq^ = is an arbitrary q x q covariance 



matrix} 



As is the case under H , , the hypothesis H , , explicitly 

vc m vc 

requires ^^q^^^OO^^ * * *^^00^ ^^^^ common value Z^^ , and 
implicitly relates the parameters Z^^^ and Z^^|^ of the distribution 

(k) (k) 

of scores for one group to the corresponding parameters Z^^^ and Z^^^ 

of the distribution of scores for any other group, g ji^ h . However, as 

(g) 

before, H . , places no restrictions relating the parameters 3^ ' 

and group to the corresponding parameters 3^^^ and ^^]q 
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of any other group; this suggests that we estimate p^^^ and ^||)q 
their "usual" estimators B^^^ and (^g)"'^ji!o ^ respectively, 
g = 1, 2,*««,k • 

Because H , , requires that the ir.arginal expected score vectors 
m*vc* 

\x^'^\\x^\ i * * ,\x^^ must have a common value , we can estimate by 
the pooled estimator: 



(5.7) ^0 ^ I \ Vo'^ 

g=l 



The residual cross-product matrix 



(5.8) Aoo= E N (x(s) - ^^).(x(s) -^^) 
g=l 

can then be cotibined with W^^ = E V^q to provide an estimator 

g=l 

g -A. 

for Z^^ • J'inally, since H , , does not restrict the relationships 
00 ^ m*vc* 

among a^*^^,a^^^, . . .,a^^^ , we are led to estimation of these parameters by 
their "usual" estimators a^*^^,a^^^, . . a^^^ , respectively. 
In the Appendix we verify that the estimators 



il^^^m'vC) = x^ , a^S^m^vcO = a^^^ , 
(5.9) W-^-^)=|(^00 ^ V ^ P^^^(inWc^) = B(^) 

^ll-O^"" ^ ~ N ^11-0 

g 
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are respectively the MLE of , OC 



T. S^^^ and 



g =^ 1^2, .•,,k , under H^,^^, • The MLE of the parameters (kt,2) can 
now be obtained by substituting (5*9) in (5.2). This substitution results 
in the following MLE: 



>(g) 



(5.10) 



^ 00 '^00''^ 00 • Oi 



'10 ^^00 ^ 00 '^00'' N 11.0 10 ^^00 ^ ^ 00 Vj-*^ 00 Dl 



for g = 1,2, ...,k . The values of the MLE (5.9) and (5.IO) for the 
example described in Subsection Jt^are summarized in T^ble k. 



5 '5 Maximum Likelihood Estimators under H 



vc 



In the model defined by the hypothesis H , scores by all individuals 

vc 

(in any group) are assumed to hn.ve a common covariance matrix 2 , but 

not necessarily equal expected score vectors. That is, under this model, 

the parameters {\i^,Z) of the distributions of the scores are restricted 

to belong to the parametric subspace cd , v^here 

vc ' 

%c = ^^^A^'' ^^^^ 2^^^ = ... = Z^^^ = 2 , 2 an arbitrary 
p X p covariance matrix} 

The MLE of the parameters (^^,z) under this model are well known 
[Anderson (1958), p. 2^+8] : 
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Table k. Maximum Likelihood Estimators under H , , for the 

m'vc' 

Illustrative Example 



i2^-'-\m'vc') = (5'+.51, 2U.12, 15.10) , 



i^^^^m'vc') = (5»+.51, 2U.12, ih.Sh) , 



^^^^(m'vc') = (5»+.51, 2J|.12, 15.61) , 



E^^^m'vc') = 1 


(11152.98 ' 
971.72 
619-05 


971.72 
722.60 
Jil7-62 


619.05 
Jil7.62 f 
285.7*+ ■ 


y 


Z^^^m'vc') = 


f 11152.98 
971.72 
^ 619.1^0 


971-72 
722. 60 

U21.56 


619. Uo \ 
J|21.56 
280. lli-/ 




£^^\tn'vc') = 


/ 11152.98 
971.72 


971-72 
722.60 

UU1.52 


6i^6^5\ 

UU1.52 

500.69/ 


1 

> 



dpCw'vc') (5U.5I, 2U.12) , 



''(m'vc') = 0.5502 , ''(m'vc') = -O.O705 , a^^^(-m'vc ' ) = 0.1528 , 

a(l)/ . .^ /0.5928\ -(2), , /'o.56o8\ -(5), , /o.565o\ 

^(m'vc') = [Q,ok^^] , ^(m'vc') = , ^(m'vc') = ^0^^226/ ' 

£[j;jQ(m'vc') = 19.82 , £^^^Q(m'vc') = 15. UO , yS^^^^{Wvc') = II.90 . 
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where 
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s=l 



= Z V 
^10 \il s=i 



(g) 



Here, we did not need to use the equivalent parameterization (5.l) in 

order to obtain MLE for (^l,E) , since the results are directly and easily 

obtainable. However, for the sake of comparison to the results given in 

previous subsections, we can obtain the MLE of ul^^ , a^^^ , Z^^^ = Z 

^0 ^ ^ 00 00 

p(^) E p , -= Z^^.^ , Where 



Z = 



"00 



^00^ 



^'^0 \i.o - P'^ooP 



\ 



/ 



by substituting (5-11) intc (5-1) • The result of this substitution is the 
following list of MLE: 

(5.12) £oo(^^) = |^00 ' P(--)=^oJ^01 ' 

hiJ""'^ = I (\i - Vo>oi) • 



Ths values of the MLE (5-11) and (5-12) for the example described in 
Subsection 5.0 are suimarizel in T-?ble 5. 




Table 5* Maximum Likelihood Estimators under K for the 

vc 

Illustrative Example 



^^■'■^vc) = (55.86, 22.52, 111. 77) , 
il^^^vc) = (55.62, 25.ii5, , 
(x^^\vc) = (56.05, 2k.k0, 16.21) , 



/IIL52.98 971.72 629. 1|6 

Z(vc) = 971.72 722.60 I127.5O 

\ 629. ii6 ii27.50 289.11/ 

f^^^^vc) = (55.86, 22.52) , a^^^vc) = 0.1122 , 

^0^^'(vc) = (55.62, 25.I15) , a^^\vc) = -0.2712 , 

ap)(vc) = (56.05, 2I1.I1O) , a(5)(vc) = 0.5675 , 

2oo'-) = i%t ' ' ki.oM = 15.87 



er|c 



4 
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Maximum Likelihood Estimators under H^»^^ 

7 

In the model defined by lipothesis H^i^^ > the scores of all 
individuals on subtest have^the same marginal distribution (with 

common mean vector and commoh^ovariance matrix Sqq )• Further^ the 

scores of all individuals on the remainder of Ijhe test have a common co- 
variance matrix E^, y but not nece.^sarily a common mean vector (that is^ 
^Jt£■'■\^l^^^^ . . are not necessarily equal). Finally^ under H^»^^^ 

the score x^^^ of any individual on the common subtest serves as 

an equally good predictor of the score x^?^ of that individual on the 



remainder T of the test^ regardless of the group to which the individual 

^belongs. (That is^^^the correlations .between elements of x^^^ and 

element^, of x^?^ are the same for all individuals i in all groups g .) 

r -^Thus^ under H , the parameters (kt^Z) of the distributions of the 

m vc M *w 

scores of individuals on the various tests are restricted to belong to the 
parametric subspace ^^i^^ y where 



03 

m' vc 



= {(^i^S): = v-^^ = ... = = \Xq is an arbitrary 

1 X q vecuor^ 

E^*^^ = E^^^ = = E^^^ f I is an arbitrary p x p 

covariance matrix) 

The MLE of the parameters (|i^E) can be obtained as special cases of 
results obtained in a previous paper [Gleser and Olkin (1966)]^ or by direct 
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analysis, as in the Appendix to the present paper. Note that the hypothesis 

H implies the folio wlng'^^elat^ among the parameters defined 

m*vc 

in (5.1): 

^00 "•^00 - • • • - ^00 ~ ^00 ' 



(5.15) 



yd) _ y(2) _ _ y(k) _ ^ 



The hypothesis H^,^^ imposes no restrictions concerning relationships 
between (:^^\a^'^\ . . . , and indeed differs from the hypothesis H^, 
only in imposing the relationship = jIq^^ = ... = = \Iq on the 

marginal expected scores of individuals on subtest . This additional 
restriction suggests estimating \Iq by the pooled estimator defined 
in (5'T)j thus freeing the residual cross-product matrix A^q to provide 
additional information about Z^^ . Since under H^^ we estimate Z^^ , 
P , and Z^^.o by W\q , Wq^Wq^ , and (N)'^^.^ = 
[W^^ - ^10^00^01"' ' ^^spectively, the arguments used in Subsection 3.2 of 
this paper lead us to think of estimating p and 2; q 

by w-Jw, 



and 



q , respectively, and to think of estimating Z^.^ by using 
the pooled estimator (N)''"^[W + A^^J . Finally, since H^, 



imposes 



(l^ (?) (k) 

no relationships among a^^^a^ . . . ^ beyond those Liposed by H^^ , 
,n estimate a^-^\a^^\ . . . ^a^^^ by the estimators used to estimate 



we can 



1 ' • IP - ' ^ ■■ ' " — . « IV .,^,.^-..1^.*.. « ^ — - " ""^ ^ ^ 
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these parameters under H^^ ; namely, x^"*"^ - ^o''^^^00^01^ " ^0 ^^00^01^ 
and x^^^ - J^o^^Wq^Wq^ . In the Appendix, it is shown that these 






estimators: 


















^ii.o^"'-^) = 1 (^11 - VooV ' 






are indeed MLE of the corresponding parameters. To obtain MLE of , 






we can substitute O-lh) into (5.2), to.king account of the equalities (5.15), 






and obtain 






(i^^^m'vc) = (aQ(m'vc),ij[^\m'vc)) 












(5.15) . . 

/ ^00 (^'oo ^oo)^o>oi ^ 

£(in'vc) =1 1 L ' 

Ko^'w^^^oo •• -00^ ""ii-o " ^lo^^oo^^oo - ^oo>Woi / 






as the MLE of . S^\\S^\ . . . ,\S^^ , and L respectively. Values of the 






MLE (5.1*^) and (5.15) for the example described in Subsection 5.0 are 






summarized in Table 6. 






i 
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Table 6. Maximum Likelihood Estimators under H , ^ for the 

m vc 

Illustrative Example 

jJ^^V'vc) = (51^.51, 2I1.12, 15.15) , 

iJ^^V'vc) = (51^.51, 2k. 12, Ik.ll) , 

ja^^^m'vc) = (51^.51, 2h.l2, I5.61) , 



/li4-52.98 971.72 629.14-6 
Z(m'vc) = I 971-72 722.63 k21.^0 

629.14-6 14-27.50 289.11 



f?Q(m'vc) = (51^.51, 2I4.12) , 
a^-'-^m'vc) = 0.1122 , a^^^(m'vc) = -0.2712 , a^^^(m'vc) = O.5675 
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5.5 Maximum Likelihood Estimators under H 

mvc 

Under the model defined by H^^^ the ^ests (T^^T^); (TQ^Tg); . . 
(T^^T ) when applied to the k randomly chosen groups produce statistically 
equivalent scores. That is^ the scores (^oi^^^li^^ ^"^"^ individuals 

in all groups are identically distributed with identical expected score 
vectors |i = ^^O^^l^ identical covariance m&trices 



E = 



/e E * 
' ^00 01 



^10 ^11 



The model defined by H^^^ thus requires the parameters of the distri- 
butions of scores to belong to the parametric subset co ^ where 

^ ^ mvc 

^vc ^ ^^^^ ^^^^^ = ... ji^^^ H ji is an arbitrary 

1 X p vector^ 

2^(1) ^ 2^(2) ^ ^ ^(^) « 5^ is an arbitrary p x p 
covariance matrix) 

The MLE of fit and E uader H are well known [see Anderson (195S)J. 

mvc 

These estimators are 



T 
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1 k / \ 
fi(mvc) = i Z N x^^'' = X , 



k k 

(5.16) £(nivc) = ^ [ Z V^S^ + Z N (x^s) _ 5).(i;(g) / ^)] 

= I [w + a] 

The MLE of the parameters 

(5-17) _i _ _i 

^00 ' ^ " ^00^01 ^ ^11-0 " ^11 " ^lO^OO^'Ol ' 

are obtained from (5«l6) through an obvious substituticn. The values of 
the MLE (5«l6) and the MLE (5«1T) for the example described in 
Subsection 5.O are summarized in Table T. 

3*6 Some Comments 

&oing back over the lists of MLE under the various hypotheses^ certain 
general rules can be observed to be at work. The assumption of the 
equality of the marginal covariance matrices ^qo^^^OO^^ * * *^^00^ ^^^^ 
affect estimation of the mean vectors ^^^^ ^^^"^^ ^ . . . y^^^^ (vhen compared 
to MLE for the mean vectors when equality of the 2qq is not assumed), 
but allows adjustment of our estimates of Zq|^ and 2^|^ through a 
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Table 7. Maximum Likelihood Estimators under H_ for the 

rave 



Illustrative Example 



fi(ravc) = (51^.51, 21^.12, 15.18) , 



f (mvc.) = 



/ll4-52.98 
971.72 
y 629. k6 



972.72 
722. 60 
14-27.50 



629. k6' 

14-27.50 

289.11 



aQ(ravo) = (5^^.51, 2I4.12) o:(ravc) = O.159O 



X _ /li^52.98 97l.72\ _/0.5750\ - r^.) - 15 

^"1971.72 722.60] ' ^("^''^ =-0.0870/ ' ^11.0^™^^'' - 



-5U- 

regression of the usual estimators and (H ) on tne 

^ g; 01 ^ 11 

residual (Ng)"^^^^ (N)""^ Z V^^^ around the pooled estimator 

(N)~^ Z V^§) of the common value E^^ of ^O^^^OO^^ ' " '^O^ ^ 

g=l 

g = 1^2^ •••^k (see Subsection 5.1). 

The assumption of the equality of the marginal expected score vectors 
'^r)^\^^\ ""'^V^^^ } similarly permits us to adjust estimation of the 
usua]; estimtors 'a^^\x^^\ . . . ^ and x|^^ of [x^\\i^\..., and ^i^^^ 
by regressing these estimators on the residuals x^"^^ - ^o^^O^^ " ^o^***-' 
and ^0 0 ^^^^^ pooled estimtor x^ of the common value 

^o"^^^0^^^ * * *^^0^^ ' This assumption also frees the residual cross- 
product matrix A^^ to help provide additional information for estimating 

when it is known that ^ l!^ = ... = Z^^^ = E^^ . The effects 

of such adjustments on the resulting estimators are illustrated in Tables 
2 through 7* 

Although the adjusted estimators may provide superior accuracy in 
comparison to the unadjusted estimators^ the distributions of the adjusted 
estimators are usually more complicated than the distributions of the 
unadjusted estimators^ and do not promise to be directly amenable for the 
purpose of forming confidence regions for the various parameters. In 
such cases^ r.he indirect route of obtaining confidence regions for the 
parameters (5-1) is often more promising^ *nce the MLE of these parameters 
in many cases have tractable distributions • Since the basic distribution 
theory for those estimators which do have convenient distributions is known 
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Lsee Gleser and Olkin (I966), (I969), (1972a), Anderson (1958)J, and since 
our remaining distributional results appear to be too cumbersome for 
practical use, distribution theory for the MLE is not given in the present 
paper . 

Tests of Hypotheses 

In previous sections we have described 5 separate hypotheses H^^, , 

H , . , H , H , J and H . These hypotheses specify relations 
m*vc* ' vc ^ m'vc ^ mvc ^ ^ 

among the parameters of the distributions of test scores cn k psychological 

tests (^o^"^l^^ ^"^0^ "^2^^ * * *^ ^"^0^^^ * Section 3 we sumarized the maximum 
likelihood estimators (MLE) of the parameters under each of these hypotheses. 
We also indicated what fonn the MLE of the parameters took under the general^ 
all-inclusive hypothesis^ , in which the parameters ( [x^^\z^^^ ) 
of the g -th test score distribution are not necessarily functionally 
related to the parameters ( ji^^\z^^^ ) of any other test score distribu- 
tion, h ^ g . In the present section, we describe statistical tests of 
hypotheses v^hich, upon the basis of the given test score data^ allov/ us 
to decide vhich hypothesis of any pair of these hypotheses best describes 
the parameters of the test score distributions. 

h^l Likelihood Ratio Test Statistic 

Let H and 11 be any 2 of the 6 hypotheses: , H . , 
a D t ^ vc 

H , , f H * H , , and H . For example, H mey be the hypothesis 
mWc' ^ vc ^ m*vc ^ mvc - a ^ 

H and II may be the hypothesis H , , * Assume that hypothesis H 
mvc 1) ^-^ m^vtf a 

logically implies li^ . In this case, classical likelihood ratio test 
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theory suggests comparins to by means of the likelihood ratio 

test statistic 

max p(x^V) 
a^b max p(x/y; ^ 

where 00 is the subspace of the total parameter space 00. which cor- 
a Xt 

responds to hypothesis ^ and 0^ is the subspace of the parameter 
space which corresponds to hypothesis . Since implies ^ 00^ 

is included in ay ^ and thus max p(x^V) < max p(x^v) . Since c^firly 

\ ^ 

A , > 0 ^ it follows that 0 < A ^ < 1 . Values of A ^ clfse to 1 
a^b — — a^b — a^b 

favor hypothesis H ^ while values of A close to 0 favor hypothesis 

a a^ b 

• If we adopt the approach of Neyman and Pearson to hypothesis testing^ 

we call H the null hypothesis^ and reject H (not necessarily in 
a a 

favor of Hj^ ) if 

where A"^ is a certain critical constant obtained from the null distribution 

of A , (that is^ the distribution of A , when hypothesis H describes 
a^ D a^ D a 

the parameters ( ti^Z )). If we wish to test versus at a level of 

significance of 7 ^ 0 < 7 < 1 ^ then we choose A-^ to satisfy 

(^•5) P{A^^^ < A^} < 7 , all (^1,2) e . 

In very large samples (i*e.^ when NpNg^.^.^N^ are all large^ and 

of the same order of magnitude)^ it can be shown that the distribution of 

2 

-21og A^ ^ when 11^ is true is approximate "i-y a chi -square ( X ) 
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distribution vath f , degrees of freedom^ Here f is a certain 

a^b ^ a^b 

integer v;hich depends upon the hypotheses and ^ and upon q ^ 

2 

p ^ and k • Let X (f^y) be that constant v/hich is exceeded vith 

probability 7 by a random variable having a chi-square distribution v;ith 

f degrees of freedom. Then^ in large samples^ it follows froir the above 

discussion that the critical constant defined in Equations (^»2) and 

(k.^) is approximately equal to exp[-ix (f ^7)] . Hence^ \^ large sample 

a^ D 

a likelihood ratio test of versus ^ at a level of significance of 



approximately 7 ^ rejects H if 

a 

Since H and H, can be any 2 of the 6 hypotheses H H 

a 0 t ^ vc* 

H- , , , H , H , , and H , a total of (p) = Ip pairs of 
m vc vc ^ m vc ^ mvc ^ 

hypotheses can be compared by means of a statistical test of hypothesis • 
In Ik of these pairs of hypotheses^ one of the hypotheses to be compared 
logically implies the other^ so that the classical likelihood latio test 
theory described above can be applied to construct a test of these hypothe- 
ses. These 1^ pairs of hypotheses are listed in the first tv7o columns of 
Tftble 8, 

In one of the 15 possible pairs of hypotheses^ however^ neither ^f 

the two hypotheses logically implies the other* This pair of hypotheses^ 

H , , and H , cannot be compared using the classical likelihood 
m'vc' vc 

r?itio test theory sketched above. We can^ of course^ construct a likeli- 
hood ratio test statistic A , of the form (^•l)i but choice of a 

a^b 
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hypothesis to serve as the null hypothesis is arbitrary, X ^ is not 

necessarily bounded above by 1^ and the. asymptotic distribution of this 

test statistic under either H , , or H^^ is not necessarily the chi- 

square distribution.- For these reasons^ comparison of H^t^^t versus 

H by means of a statistical test of h^/pothesis v;ould require an entirely 
vc 

separate analysis and discussion. Since it is unlikely that a comparison 

of H . . with H would arise as an iiUDortant problem in psychological 
m vc vc 

testing contexts^ we omit discussion of a test of significance for these 
two hypotheses. 

For each of the Ik pairs of hypotheses for which the likelihood ratio 

test theory is applicable^ we can construct the likelihood ratio test 

statistic A , by making use of the various maxima of the likelihood 
a^ D 

p(x^V) described in the Appendix. For example, suppose chat we v.dsh to 

make a statistical test of H versus H , . Note that H logically 

vc vc vc 

implies H , , so that H = H and ii = H , in this comparison, 
vc a vc D vc 

From Equation (A. 58) of the Appendix (remembering that (^^j^) 
parameterization in terms of the quantities defined in Equation (5*l) are 
equivalent paramete/izations), we find that 



(I1.5) max p(^,V) = H(V)| ^^qP'^I i^^^n.Q 



VQ 



where H(v) is defined by Equation (A.21) in the Appendix. Similarly, 
from Equation (A. 25) of the Appendix, 



(U.6) max p(x,0 = K(V) I | V^l"^^ ^ | ^ V^f.^J . 

A* #^ V 
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Dividing (^•5) by {k.6), we obtain the likelihood ratio test statistic: 

max p(x^y) 

(l4,E)ea3 
■\ - ~ ~ vc 

vc,vc' max p(x, V) 



k ' N_ ^11.0 

= n 
g=i 



I I ^11.0 I 



\ 

Once we have calculated the likelihood ratio test statistic A , y 

then if the sample sizes N^N^^...^N are large and of the same order of 

magnitude^ we can test versus at an approximate level of 

significance 7 by means of the test which rejects H if {k.k) holds. 

a 

Use of {h.h) requires knowledge of the constant f . y pl^s access to 

a^ D 

tables of the chi-square distribution. The constant f , can be obtained 

^ a^b 

from the well-known asyinptotic theory of likelihood ratio tests. Values 

of f , for each of the ^ possible tests of hypotheses are listed in the 
a^ D o 

fourth column of Table 8. Thus, the constant f , " needed to apply 

^ vc^vc' 

the likelihood ratio test of H versus H , in Targe samples is 

vc vc' 

given by (see Table 8): 



f = (p " - i)(p q 1) 

vc^vc* 2 



Suppose that p = 5, <1 = 2. k = 5* Then f , = 6 . If we want 
if if X ^ } } vc^vc' 

to test H versus H , at level of significance y = 0.05» then in 
vc vc' f 

large samples we would reject hypothesis H^^ if 



-ko- 



< exp[ 4X^(6, .05)] = .0018 . 

In Table 8, we give the likelihood ratio test statistics A , for k 

of the ik possible compar'^ v^ons of hypotheses (namely, H versus H , , 

H versus H i H , versus H , and H , , versus H , ). For 
mvc vc m vc vc m'vc vc' 

the remaining 10 comparisons, we recommend a modification of the likelihood 

ratio test statistic along the lines first suggested by Bartlett (1957)* 

From the modified statistic L given in Table 8^ however^ the likeli- 

a, D 

hoOd ratio test statistic A . may easily be obtained by merely sub- 

a, D 

stituting N for m or n ^ g = 1^2^...^k ^ and N for m or n 

s s s 

in the formula for ^ . For example^ in Table 8^ we suggest the 



statistic 



k 

(k.&) L , = n 
vc.vc' 



V 

m 11.0 
I ^ ^1.0 I 



\ 



/ 



for testing H versus H , (here, m = N - q-1. g = li2....,k, 

^ vc vc^ ^ ' g g ^ B ^ ^ ^ ^ 

and m = £ m ). To obtain A , i we substitate N for m and 
g=l g vc,vc' ' g g 

N for m everywhere in (^.8); the result is the formula for A , 

' vc,vc' 

already obtained in (^•7)* 

k.2 Bartlett Modifications of the Likelihood Ratio Test 

Consider the likelihood ratio test statist.ic A ^ for testine 

vc^ t ^ 

H against general alternatives H. . When k = 2 and , 

vc u 1 ^ 

it is known that the test of hypothesis which rejects H^^ when 
A - J. < is a biased test [Das Gupta (I969)]. In the univariate 



-1^1- 

case ( p = 1 Bartlett (1957) suggests modifying the likelihood ratio 

test statistic for testing the equality of variances among k populations 

by replacing the sample sizes N by the degrees of freedom n of the 

estimators of the variances of the g -th population, g = 1,2, ...,k , 

everyi'/here these quantities (the N 's) appear in the formula for 

the likelihood ratio test statistic. Anderson (1958; p. 2k9) proposes a 

similar modification of the likelihood rati0:^.test statistic 

for testing the equality of the covariance matrices among k populations. 

When = = . . . = N, • the likelihood ratio test statistic \ . 

12 k L 

and the Bartlett -t-^Te modification L . (see Table 8) of this test 

^ vc,t; 

statistic are monotonic functions of one another, so that in this case 

7^ and L ^ yield equivalent tests. That is, if we construct 

vc,t vc,t 

a test of H versus of level of significance 7 which is based 

vc t 

on A , and a test of H versus H. of level of significance 7 
vc,t ^ vc t 

which is based on L^^ ^ (and which rejects H^^ for small values of 
L ^ ), then the test based on . rejects H if and only if the 

VC,t ^ vC,L , vv. 

test based on L^^ ^ rejects H^^ . 

When at least two N 's are unequal, however, the tests of H^^ versus 

H based on A ^ and L . , resp.ectively, are not the same. In 
t vc,t vc,t 

particular, the test based on ^ is biased [Das Gupta (1969). Sugiura 

and Nagao (1968)]. The difference between the tests is most pronounced 
for snail and moderate sample sizes. For large samples \^^^ and L^^^^ 
are approximar.ely equal *to one another, and a test which rejects H^^ 
when 



T 
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has level of significance approximately equal to y • 

Next consider the test of H versus H, based on the likelihood 

jnvc ^ 

ratio test statistic A + • It can be shown [see Anderson (1958; 
Chapter 10)] that 

mvc^t mvc^vc vc^t 

where A is the likelihood ratio test statistic for testing H 

mvc^ vc 

versus H . Anderson suggests that since a Bartlett modification of 
vc 

X , improves the properties of the test of H^^ versus H. ^ the 
vc^t * vc 1/ 

identical Bartlett modification of A^^^^ will improve the properties 

of the test of H versus H. . As far as the property of unbiased- 
mvc t 

ness of a test is concerned^ Anderson's conjecture is correct. That is^ 

whereas the test of H versus H. which rejects H for small 

mvc t 

va]ues of A ^ is a biased test when the sample sizes N.^Np^...^N, 
mvc^t 

are not all equal, the Bartlett modification ^mvc.t ^^^^^^ 

yieldf an unbiased test. The test statistics A^^^^ and yield 

equivalent tests when = = = \ ^ nearly e^ual for large 

sample sizes, regardless of whether the sample sizes N^^N^^ . . .^N^^ are 

equal or not. The Bartlett modification of the likelihood ratio test 

statistic A ^ thus has greatest effect upon the properties of the 
mvc, t 

resulting test of H versus H. for small or moderate sample sizes 
^ mvc t 

N.,No^...,N, , which are not all equal to one another. 



Bartlett modifications for the test statistics A ^, . and A t.-^t 4. 

vC f\j m ^ V 

are justified by the same arguments used above to justify Bartlett 

modifications of A . and A . • In all of these cases the Bartlett 

vc,t mvc^t 

modification L , of A is fonned by replacing the sample sizes N 
a^ D a^ D o 

by the degrees of freedom n = N - 1 of the appropriate estimator of 
the covariance matrix { (N ) as an estimator of Yr^^ in tho, case 



of A 



Vc,t 



and 



A ^ • and (N Y^^^ as an estimator of lif} in 



Note that 



the case of A . ^ and A , . ^ ) in the fomula for \ , 
vc*,t m*vc*,t 2.^0 

in computing and analyzing "K , . (or L , 4. ) and A t,^^t 4- (^^ 
^ ^ *' ^ vc.t vc,t mvc^T^ 

L • . . )• we can act as if only observations of scores x^f ^ of indi- 
m VC ,t 01 

viduals on subtest have been obtained* 

Let us next turn to A , . Note from (^•T) that 

vc,vc 



vc,vc 




k 
n 

g=l 



N ll'O 



1 2 v^^^ i 



N \ ^11.0' 



' N 11.0 



In 



It is known that V^^^q has a Wishart distribution with m^ = N^ - q - 1 
degrees of freedom and expected value E(vj|^Q) = «ig^[f*o ^ g = 1,2, .••,k 



The quantity 



/ 



(1^.10) 



= n 

g=i 



I i v^s) 

' N \l«0 

■ g 



\ 



N \ 11-0 

g=l 



.(g) 



has the form of a likelihood ratio test statistic for testing the equality 
of the residual covariance matrices ^i^f^Q ' ^ ~ 1,2, ...,k against 
general alternatives, [indeed, it can be shovm that is the likelihood 

ratio test statistic for testing the hypothesis ^^^^q ~ ^ll-O = '" ~ ^±±.0 
against the hypothesis .] Since 

W = W - W W" w 
11.0 11 10 00 01 



(1^.11) k , J k 

g=l g=l 



(B - "oo 01' 00 ^ OO^Ol' ' 



00"01' 



it can be seen that 



"2 = 



N ^, ^11-0 
' N 11-0 




somewhat resembles the likelihood ratio test statistic of MANOVA. [Actually, 
U2 is the likelihood ratio test statistic for testing H^^ a^inst the 

hypothesis that Z^^]q = sg^o = ^ = 4l^0 > = 4V = " ' 

= "^^"S arguments presented earlier which justified modifying 

\ , it would appear that the performance of U, as the basis of a test 
vc,t ' ^ 



-1^5- 

of equality of covariances would be improved if in the formula for 

(g) 

we everywhere replaced N by the degrees of freedom m^ of Yiiio > 

g = 1,2, ...,k , and replaced N E N by m = E m . Then, using the 

g=l S g=l & 

arguments used earlier to justify modifying A^^^ ^ , it seems appro- 
priate to make a similar modification in the formula for • 
That is, we modify , by replacing N by m , g = 1,2, .••,k , 

VC , VC g o 

and N by m everywhere in Equation (h.9). We call the resulting 
statistic. 



L 



vc,vc 



k 

n 

g=l 



/ 



m 11-0 
_S 



m ^ll'O 



■gin 



/ 



the Bartlett modification of ?\ , . Since 7\ . „i„^t = \ 



vc,vc 



m'vc.m" VC 



VC , VC ■ 



'A . . = X A ^ ^ I = ^ y and 

nr/c,m*vc' vc,vc* mvc,m*vc m[vc,vc' vc,vc wro,vo ' 

A , , = "h ,A , , it seems appropriate to make modifications 

m vc,vc vc,vc* m vc,vc ' 

of the likelihood ratio test statistics '^^^ty^ m'^""* ^ ^"tvo m'vc* ^ 

?s , • and 7\ , , similar to the modification which we have just 

m[vc,vc ' m vc,vc' 

made to 7\ , • These Bartlett modifications are exhibited in Table 8. 
vc,vc 



For every Bartlett modification L , of a likelihood ratio 

a,D 

Stic 7\ shown in Table 8, the following comments apply: 
a, D 

(i) A test of H versus E based on L , rejects H when 
^ ^ a D a,b a 



a,b ' 



where L* is determined from the distribution of ^ 

when H is true, 
a 
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(ii) When the sample sizes are equal L^^^ and A^^^^ are mono- 
tone functions of one another and hence lead to equivalent 
tests of versus H^^ • 

(iii) \flien the san5)le sizes are unequal, and are either small or 
moderate in size, the test of versus of level of 

significance 7 which rejects when "h^^^ < is not 

the same test as the test of level of significance 7 which 

rejects H when L , < . In certain cases, it is 
a a,b 

known that che former test is a biased test, while the latter 

test is unbiased. It is conjectured that the test based on 

A , is always a biased test, while the test based on 
a,b 

L , is always an unbiased test, 
a, b 

(iv) ^'Hien the san5)le sizes are large, 

approximately equal; further, the test which rejects 
when 

and the test which rejects H when 

a 

both have level of significance approximately equal to 7 ♦ 

For each of the l^i- pairs of hypotheses which have been covered by 

our discussion in this section. Table 8 lists the test statistic ( 7^^^^ 

or L ^ ) which is recommended for testing this pair of hypotheses, and 
a,b 



I 



the degrees of freedom f , of its asymptotic chi-square distribution 

a, b 

under the null hypothesis ( H ). Table 8 also provides cross-references 

a 

to other articles [or to Anderson* s (1958) textbook] in which some of these 
hypothesis testing problems are considered* 

k»3 Asymptotic Expansion for the Null Distribution 

Each of the likelihood ratio test statistics listed in Tab.le 8 is a 

ratio of products of powers of determinants of certain random Wishart- 

distributed matrices. The exact null distribution of each such test 

statistic can be shown to be the same as the distribution of a product 

of powers of certain independent beta variates* Thus [see Box (19^9)^ 

Anderson (1958; pp. 205-209)], when N^^Ng, ...,N^ are all moderately 

large (say, > 5(p^ + k^) , g = 1^2, . . .,k ), the null (^^Is^tive 

distribution function of -2 log T , , where T,=A, orT,=L, 

^ a,b a,b a,b a,b a,b 

depending on the hypotheses and to be compared, may be 

approximated as follows: 

P{-2log T^^, < rr= (i - ^,^,)P{x2(f^^^) < p^^.T) 

+ 0(N'5) , 



ERIC 
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2 

where X (f ) represents a random variable having a chi-square distribu- 
tion with f degrees of freedom, f , is the degrees of freedom of the 

a, D 

asymptotic null distribution of the likelihood ratio test statistic (given 
in column h of Table 8), and ^ ^ and p , are constants depending on 
N^;>^2^- --^^^j^ ^ P ^ ^ f k , and the hypotheses ( and H^^ ) being 
compared. Given a desired level of significance 7 , we may use 
to obtain the critical constant 1^ for the test of versus 
which rejects when 

To do this, we first find a number t Ay) which satisfies 

a , D 



Then 



(l^.l6) T* = exp 4 [ ] • 

^a,b 

Thus, an approximate test of significance of level y for H versus 

a 

rejects when (^.1^) holds, where is given by {h*l6), and 

T,=A^ orT^=L^ depending upon the hypotheses H and H, 
a,b a,b a,b a,b ^ ^ ^ a d 

which are to be compared. 

Table 9 gives formulas for obtaining values of p , and ^ • With 

a, D a, D 

few exceptions, explicit formulas for p , and (t> , in terms of the basic 

a, D a, D 
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dimensions q ^ p ^ k , N^^Ng^.-.^N^ * are very long and complex (this 

is particularly the case for ). Thus^ for the sake of conipactness, 

a^ D 

we have found it expedient to give explicit formulas for p and ^ in 
only a few cases; the other p's and (l)'s in Table 9 are then expressed as 
certain functions of these explicitly defined p's and 0's. The functions 
needed to achieve the above-mentioned compactif ication in Table 9 are de- 
fined as follows. 

Kf , p , + f, p, 
('+.17) r(a,b,c;.) = ^ , 

a^b b^c 

and 

2 



(I1.I8) ^(a,b,c;K) = j-j- U^p^^^D^ + 4,c%, 

r(a,b,c;K) 



c 



f , f . 



a,b b.c / ^ 



Motivation for use of the functions defined by (^i-.l?) and (^-18) can be 

found in Gleser and Olkin (1972b). ' Here^ we illustrate how to use these 

two functions, and Table 9^ to obtain p » » ^ and 0 ^t„^t when 
^ mvc , m vc mvc , m ' vc ' 

(as ii\ the illustrative example of Subsection 5*0) q = 2 ^ P = 5 ^ 

k = 5 , and N, = = N, = 100 . 

1^5 

Looking at Table 9 and Equations (^i-.l?) and (h.lQ)^ we see that to 

determine p , , and 0 . . ^ we need to first find the values 
mvc^m'vc' mvc,m'vc' ^ 



^mvc^m'vc ^ ^mvc^m'vc ^ "^mvc^m'vc ^ ^m'vc^m'vc' ^ ^m'vc^m'vc' 

and 0 , , f • Explicit formulas for these quantities (in terms < 
m*vc,m'vc 

q ^ k , r ,...^N ) are given in Tables 8 and 9- From Table 8^ 
X ^ k 
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ERiC 



^ I = (p - q)(k - 1) = 2 . 



f . . = (P - q)(p + q + l)(k - 1) ^ g 



tn vc,m vc 



From Table 9, 



p 
tnvc 



2N - p - q - k - 2 390 Q 
,tn'vc 2N " 600 - '^"^^^ ^ 



P, 



, . (p-ci )(k-l)[( p-q )^Mk-l)^-? ] ^ 0 

12(2N - p - q - k - 1)2 ■ 

= 2 1 1 u q(p - q)^ + 3(p - q) - 1 1 q(p - gk - i) 

m'vc^m'vc' _i ™ ™ ^ 6(k -l)(p + q + l}(p - q) (p + q + l)m 

= .99551 . 



and 



- 6(k - l)(p 4. q + 1)(1 - p , , 

^ ^{^ - 1) [5(p 1 - kq)2 (p . qf (k - i)2q2 . 
tn 

1875 ^ ^00002 . 



(12) (2621)2 
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Thus, from Table 9 and (h.Yj), 



i^t'l - 5(P - q) - 1] 

^mvc^m'vc' " 6(p + q + 5)(k - l) 



^ (q H- l)(p H- q ^ qk - k) 
(p + q + 5)m 



= 1.00020 



From Table 9 and (h.l&)y 



f f 

mvc^.m'vc m'vc^in'vc' . N 



,,8(i.ool,l,l,)2 = It ^^^98500)2(0)2 . (1.000T6)2(. 00002) 



" (Dfe} f It (-93500) - 1V60076]2} 



= .00000? 
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in the above exantple, P^c,m'vc ' "m'vc^m'vc' * ^""^ W,m'vc 
were all close to 1, and «I'^c,m'vc ' Vvc,m'vc' ' Vvc,m'vc' ^'-'^ 

all veiy close to 0. This result is not exceptional. For every test of 
hypothesis represented in Table 9, it can De shown that p^^^^ 1 and 

<)) ^ ->0 as N,,N^,...,N. all tend to <» . (Olris fact follows since the 
a,b L d k 

limiting null distribution^ -21og T is a chi-square distribution 

with f ^. degrees of freedom.) In general, (D , is closer to 0 in 
a,b " 

large samples than p is close to 1. For example, in the case con- 

sidered above, the three <j) -values were all 0 to four decimal places, 

while the p -values were .98555, •99551> and 1.00020, respectively. When 

d) is very close to 0, but p , is not so close to 1 that we can set 
a,b ' a,D 

p = 1 without loss of accuracy, t (7) may be found by setting 

a, D 

t (y) = X(f ^,7) , as can be seen by setting = ^ (^-15), 

a,b ' a,b' ' ' 

and T* maybe found from (k.lS). That is, when <s>^^^ is very close 
to 0, 

T* = exp -i[ ^ ] . 

Pa,b 

Of course, when N,^,N2, . . .,Nj^ are so large that p^^^ = 1 and o^^^^ = 0 
(to several decimal place accuracy), then T» may be found from the 



formula, 



= e:!p '^i^^y7) 
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5. An Application - 

In previous sections, we have described the implications of the 
hypotheses H^^, , H^,^^, , H^^ , H^,^^ . and H^^ for the psychological 
testing situation in which k tests (T^.T^), (T^^T^), • • • . (T^^T^^) are given 
to k separate groups of individuals • If the k groups of individuals 
taking the k tests can be regarded as k random samples of individuals 
from a certain population of individuals^ and if the environments in which 
the k tests are given are homogeneous, then we would expect the distribu- 
tions of the scores of individuals on the k forms (T^^T^)^ (^q^ ^2^^ ' ' 
(Tq,Tj^) to have parameters related by hypothesis H^^^ if the k forms 
are parallel, and by hypothesis H^,^^, whether the k forms are parallel 
or not- 

If the assignment of individuals to forms^ or individual-fonn pairs 
to testing environments, has not been performed in such a way that dif- 
ferences among the parameters of the k test score distributions can be 
attributed solely to differences in the forms (Tq,T^), (T^^Tg), • • • ^ (Tq,T^) , 
then any of the hypotheses H^^, , H^^ , or H^,^^ may relate the param- 
eters, or there may be no relationships among the parameters ( )• 

In this psychological testing context, an experimenter who believes 
that his experimental design has made adequate allowance for individual 
differences and environmental effects upon testing performance would usually 
start testing hypotheses by comparing the hypothesis (parallel forms) 

to the hypothesis H^,^^, . In this section, we illustrate the test of 
these two hypotheses in the context of the example described in Subsection 
5-0- There, 5 forms (T^^T^) , > ^V^J^ ^^^^ ^^^'^ ^^^^^ """^^ 
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individuals. The subtest common to all 5 forms has 2 parts^ while 

the fonns as a whole each^have 5 parts. Hence^ k = 5; Q. = 2^ P = 5/ 

= = = 100 . 

12 5 

From T^ble 8, the recoiranended test statistic for testing H versus 
^ mvc 

H , , is- 
m'vc' 



(5-1) 



mvc^m'vc* 



5 /lr^i??oN|(Woo^Aoo)i 

n -s ^ 

g=i V (w + A) I 



2m 



where in^ = m2 = Tn^ = 97^ m=m^ + m2 + Tn^ = 291 . Because m^ = mg = xn^ ^ 
and 



wnere 



we can rewrite L , , in the fom 
mvc^m'vc 



(5.2) 



nrvc^m'vc' 



_\ '^ii-o' 



i(97) 



From the data given in Table 1, we find that 



^11 ?0 = 1981.95688 , = i5iU).iK)29ii , v^^Jq = 1189.6725^^ , 
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and that 



Note that I = , g = 1, 2, 5 , i| I = \ Q^.q since 

these quantities are scalars. Thus^ from (5-2) 

(5.5) L , , = .00998 

In Subsection i|-.5^ we indicated that a test of H^^^ versi^^ H^iy^i 
of level of significance approximtely equal to 7 rejects H^^^ when 

t , ,(7) 

= exp ^ L * J y 

^Twc^m'vc' 

and t , , is obtained from (i|-.15)- Since 
mvc^m'vc' 

f . , = 8 , p , , = l.OOi^ifi^ , ^ , , = 0.000002 
mvc^m'vc' ^ ^mvc/m'vc' ^ ^inYc^m'vc' 

we see from (^•15) that 

t , .(7) = X^(8,7) . % 
mvc,m' vc' ^ \ ^ / / 



and that 



= exp 4 [ ] 
^ - ^^V 2 ^ 1.00401 ^ 



If we wish to test H versus H . at level of significance 7 = 0.0 5 

mvc m vc 



then 



X^(8, 0.05) = 15. 507 . 
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and 

Since L , , is greater than • we cannot reject H at the 0.05 
mvc^m vc 

level of significance. Thus^ the three forms of the SAT can (tentatively) 
be regarded as parallel forms of the same test. 
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Appendix. Derivation of the Maximum Likelihood Estimators 

In Section "d -.w noted that the joint density function p(:<,V) 
of the sufficient statistic (x,V) has the form 

p(x,v)= n p(i(^))p:v(^^) , 
g=i 

where p(i^^^) is given b^Eqiuition (2.6), and p(V^^-') is given by 
Equation (2.7), g = 1,2, ...,k . Adopting the approach to the derivation 
of the maximum likelihood estimators which was mentioned in Section 5, 

we break the likelihood p(x,V) into two factors: (i) the marginal 

(1) .(2) -(k)x , „ /v(l) v^^) 
density function of x^ = (x^ ^x^ ^'''^^o ' 00 " ^ 00 ^^00 

V^^^) ; and (ii) the conditional density function p(x, V Ix^, V^^) of 
00 

(x,V) given (^q.V^^) . 

From (2.6); (2-7) and Theorems 2A.5 and T-5-5 of Anderson (1958), 

the marginal density function "^h^Q^^Q^^ i*"^ 



6=1 

(A.i) =c, n {|v(,^)| ^ ' 



where is a certain constant. Since the conditional density function 

p(J,vi5(.,VQQ) of (5,V) given (^q,\q) is equal to vM/v(^^,^Qr)) > 
it can be shown that 



(A.2) 
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1 " ^'\o ' '\i.o' '^ii-o' 

where is a certain constant^ the parameters a^^^ ^ p^^^ ^ and ^^^^^ , 

g = 1^2^ •••^k ^ are defined by Equation (5'l)^ and the sample quantities 
B^^^ and V^^^^ ^ g = 1^2^...,k^ are defined by Equation (5-5)- 

The joint density function p(x^V) of the sufficient statistic (x^V) 
is tli^^roduct 

(A.5) '^'^P^^) = p(^.v|xq,V^q)p(x^,V^q) 

of (A.I) and (A.2). In Section 5^ we have shown that the parameterization 

of p(x,v) in term^s of n^^^ , Z^g^ , a^^^ , P^^^ , and Z^^J^ , 

g = 1^2^... is equivalent to the original parameterization of p(x^V) 

in terms of ^i^^^ and 2^^^ ^ g = 1^2^... ^k . In this Appendix^ we find 

maximum likelihood estimators (MLE) of the parameters \x^^^ ^ ^00^ ^ 

Q,(g) ^ p(g) ^ ^11^0 ^ ^ 1^2^... ^k ^ under each of the hypotheses 

II , , H . , , H , H , , and H . These MLE can then be 
vc^ m*vc vc ^ m*vc mvc 

transfomed (see Section 5) to obtain the MLE of the original parameters 
\x^^^ and 2^^^ , g = 1,2, . 

To obtain the MLE or ohe parameters under the various hy7;otheses, 
we make repeated use of the following lemmEis. 



-66- 



Leinina A.l. Let Z. be given s x t matrices and H. be given s x s 



nonnegative definite matrices, J = 1,2^... ,r . Assume that E H. is 
nonsingular. Then for all s x t matrices H , 

r r 
(A.U) E (Z - H)'H (Z - H) = E (Z - H)'H (Z - H) 
j=l ^ J J j=l 

(H-H)-( E H.)(S-H) , 

J=l ^ 

^ r 1 r _ 

>7here H=f E H.) E H.Z.. Hence for all s x t matrices and 

j=l 0=1 ^ ^ 

any t x t norlnegative definite matrix A , 



r r 
^.5) E tr H.(Z. - H)A(Z. - H)' > E tr H.(Z. - ^)A(Z, - H)' , 



squal^ty in (A.' 



with equality in (A. 5) if H 
Proof. Note that 



(Z - H)'H (Z -JE) = (Z - H)'H (Z - H) (Z - H)'H (H - H) 

J JJ/ J JJ cl J 

4- (H-H)'H.(Z -E)+ (g-H)'H (H-H) , , 

r ^ r 

and that E (Z - H)'H. = E H.(Z. - H) = 0 . From these two facts, 
J=l 3 3 J=l 3 ^ 

(k'k) follows. Since for all H (including the case when H = H 



r r 

E tr H.(Z. - H)A(Z. - H)' = tr E (Z . - H)'H.(Z. - H)A , 
3=1 J J=l ^ 

y\ r 

and since tr(H - H)*( E H.)(H - H)A > 0 ^ (A.5) follows directly from 

J=l ^ 

(A.1|). For future use^ note that 
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(A.7) 2 tr H.(Z. - i:)A(Z. -Hy=trA[ E Z'.H.Z. - H' ( E H.)H] 



0=1 ^ ^ . 0=1 0=1 



L eTiiina A*2 * Let U be a given t x t nonnegative definite matrix^ and let 
be a p<^oitive integer. Then for all t x t positive definite matrices^^^^ 



A 



(A.8) IaT^^^ exp[4tr a"^J] < | ^ u|"^^ e*^^^ , 

with equality holding in (a^) if A = (l/^U . 

Proof . This lemma is "a direct consequence of Lemma 5*2.2 of Anderson 
(1958). ^ 

A.l Maximum Likelihood Estimators under General Alternatives 

From Equation (A.I)^ 

(A.9) p(vV=y ^ l^oo^l'^"'"'"'^^^(lio^5oo) ^ 

g=l 

where - ^ • • , 5oO = (^J^^^OO^^ * ' ' 

4k. 



g=l 



(a. 10) 
and 
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g=l 



k 

2 
g=l 



Note that A(^^Q,5Qo) > 0 , with AC^q^S^q) = 0 "HTf (i^^^ - x^^^ , 
g = 1,2,..., k . Hence 

g=l 

with equality when (i^^^ = i^^^ , g = 1.2, -^-.k • Applying Latmna A.2 
to each term in the product on the right-hand side of (A. 12), we conclude 
that 



g=l g g--^ g 

with equality in (A. 15) holding if = 5^^^ , '^o^ = Mo^ ^ 

g = 1,2, ....k . 

Let us turn now to p(i,V li^, V^q) . Let 0! = ia^^\J^\...,a^^h > 

,-(,('\,^'\...J^h . ond 5n.o = (4i?o^4!?o--^4'i?o) • ^^^-^ 



1 



D(g,g,5^^.Q) 
(A. 11^) 

g=l ^ 

.^nd 



1 
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(A.15) EO^E^^.q) = Z tr[v[^?o(B(s^ - p(^))(e[^|o)-^(b(s) - P^^^)'] y. 
Then from Equation (A«2), 

k / \ -|(p-<l) / \ |(n -p-l) 

(A.16) v(iM^Q,\o^ = n (Iv^g^l Ivif^ol} ^ h(a,p,E^^.Q) 



Where 



(A.17) ^ 1,^ 

It is clear by inspection of (A.l*^) and (A. 15) that H<^>^}^ii.q) > 0 
and E(g,E^j^^Q) > 0 for all q , p , E^^^^ ; and that B(a,^,L^^^Q) = 0 

and m,l^^.o) = 0 if = ^^^^ = " ^0^^^^^^ ^""^ ^^^^ = ^^^^ ' 

g = 1^2^... . Further^ from Lemma A. 2^ 

e^[4tr(E(s) yV^) ] < li-v^s) -i(p-<l)Ng 
l^ll-O' 2^^^^ii.O'' ^ii-o-' - 'n ^ll-O' ' 

g 

with (^quality if e[||q = (Ng)"'Sf[f?o ' ^ " 1,2, ...,k . Hence, we con- 
clude from (a. IT) that for all a , g , Sn-O ^ 

(A.18) h(a,e,E^^.„)<a-(*'-«"' S If v'fjj"*''^ , 

g=l g. 

with equality holding in (A.18) if a^^^ = a^^^ , p^^^ = B^^^ , and 
4??0 = (V"M??0 ^ g=l,2,...,k. 
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Then 0 = (^q^^^) ' largest possible space of possible values for 



0 is co^ ^ where 



and 



% t " " ^^5oO^ • ^0^^ ^ arbitrary 1 x q vector and ^ 

Eqq^ an arbitrary q x q positive definite matrix, 
r g = 1,2,... , 

m , = {0^ = {a.,^,Y.^. -) : a^^^ an arbitrary 1 x (P - <l) vector, 

" ' P^^^ an arbitrary q x (p - q) vector^ and S^^^^ 

an arbitrary (p - q) x - q) positive definite 

matri}^ g = 1^2^ 

Let H be .a hypothesis which restricts the paratneters G = 

a ~ 

f3,E^n r.) to ^ subspace co of o). of the form 



0) 

a 



where a>^ is a subset of co^ ^ and co^ is a subset of co. . . It 
O^a O^t l^a l^t 

then follows from (A. 5), (A.9)^ and (A.l6) that 
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^ ■ 



k . V -|(n 

sup p(i,v) = [C C H Iv^^^'l ^ ^ ][ sup f(n ,E )3 
(A. 19) 

W C sup h(a,p,E q)] 

In particular, it follows from (A. 15) and (A.l8), and from (A. 19), 

that 



(A.20) max p(5,v) = H(V) H |~V^s;| g ^ 



©QO, g=l 



where 



(A.21) H(V) = n C^e-^^ S | vCs)]" ^ V" "'^^ . 
" ^ g=l 

The maximum in (A.20) is .achieved when n^^^^ = x^^^^ , = (N^) , 

a(s> = a^s) , = B^^^ , and E^^ = (Ng)"M!!o ' § = 1.2,. ..,k , 

since (as shown above) equality in (A. 15) and (A.l8) is achieved for these 
values of the parameters. Thus, the MLE of the parameters under general 
alternatives are: 

^^0 ~ 0 ^ ^00 N ^00 ' ' 

g 

?(g) _ i_ M 

^11.0 ll'O ' 

for g = 1,2, •••,k , and the maxitnutn of the likelihood is given by (A.20). 
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A.2 Ma^^imum Likelihood Estimations under H^^, 

The hypothesis H^^, restricts the parameters 9 = (^^q ^^00^^^ ^^^11*0^ 
to the parametric subspace o) » described in Subsection 5*1* Note that 

(A.22) 03^^. = {9: 9^ e c^^^^,, 9^ e a^^^) , 

where 

Let Zqq represent the common value o"^ ^oo^^^OO^^ * * *^^00^ under H^^, . 
Note that when S^J^ = S^^) _ . _ = , 

(A. 25) ^(iio'loO^ = t^^l^^A^lio'EoO^^^'^Ool"^^ ^00^00 ' 

where W^^ = V^^^ . Since ACjiQ^goo) > 0 , with kin^^T.^^) = 0 if 
[1^^^ = x^^^ , g = 1,2, ...,k , and by an application of Lemma A. 2, we 
conclude that when S^J^ = ^00^= •'• = ^00^ = ^OO ' 



for all \Iq , , with equality achieved in (A.2l|) if = y^^^ , 

g = 1,2, ...,k and Z^^ =.(N)"-S7qq . 

We conclude from (A.22), (A.19), (A.18), and (A.2l|) that 

(A.25) max p(x,V) = K(V)||W |-^^ I "^ilW^^ ' ^ 

9ea3^^, g=l g 
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As shown above, equality is achieved in (A.I8) and (A-2k) when 

..(g) _ -(g) Js) _ -(g) M _ Jg) M) _ rV^^ 

^0 ^0 ^ " ~ ^ ^ ^ " ' 11.0 " ^ g-* ^11.0 ' 

and when Z^^J^ = Z^^^ = ... = = (N)""H/qq . Hence, it follows that 

under H^^, the MLE of p.^ , Z^J^ = zg^ = ... = = Z^^ , a , p , 

and ^ are given by Equation (5«5)» The maximum of the likelihood 

*wlx*U 

under H^^, is given by Equation (A.25)« 

A. 5 Maximum Likelihood Estimators under H^,^^, 



The hypothesis H^,^^, restricts the parameters 0 = (*^o^5o0^2^i?^5ll'0^ 

to the parametric subspace co , , described in Subsection 5«2. Note that 

m'vc' 

(A. 26) 0) , , = {0: 0^ € 01, , 0- € m. .) ^ 
^ ^ m'vc' 0 O^m'vc'^ 1 n.^t 



where 

r/ X / X (1) (2) (k) 

Let fi^ represent the common value of • • •yix^^ and Zqq 

represent the common value of ^qo^^^OO^^ * * *^00^ * ^^^^^ ^^^^ ^hen 

.(1) _ J2) _ _ Jk) _ (1) _ (2) _ _ (k) _ 

j^o - - • • • - - ^0 ^ ^00 - ^00 - • • * - ^00 " ^oo ^ 

(A.2T) tdto'^^QQ) = i^M-i A(^io,ZQo)]} IZqo exp -itr S'JWqq'^ , 



where in this case 

k 



g=l 



Applying Lemma A.l, ve find that for all (^^q^'^qq) ^ "b^m'vc' ' 

A(Uo,Eoo) > Z tr N (x(s^- ) (Z^ ) "^C x ^ - %)' 

g=l 

g-i 

k T k , X -, k _/ X 

where = ( Z N )"-^ Z N xj^'' = (N)""" Z N x^^^ . Thus for all 
g=l ^ g=l ^ g=l . 

(A.28) f (^io;Zoo) < IZqJ -^"^ exp 4tr(Zoo)-^Woo A^^) • 

An application of Lemnia A. 2 to the right-hand side of (A.28) yields 

(A.29) f((io^2oo)<l|(^00^ V^l'^''^'^'''' 

for all {\iQ,I.QQ) € '"o^m'vc' ' ^^^^ equality when = n^^^ = . . . = 

We conclude from (A. 20), (A.I9), Ta-IB), and (A.29) that 
(A.30) max P(^,V) = H(V)|| (Wqo Aqq)]-*^ ^ |^v[^?J . 

Since, as shown above, equality is achieved in (A.18) and (A.29) when 
,a) = ,(^) . ... . ) = , Z(J) = Zg) = ... = Z^S^ = (N)-(W,, . A,, 
C,(g) . Jg) ^ p(g) ... b(s) , zj^lo = (Ng)"M!?0 ' ^ = 1.2,... ,k , it 
follows that under H , , the MLE of = P-^^^ = ... = ^i^^^^^ = , 
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41^ --^l^ - -4o^=^00 ^ £u-0 are given by Equation (5.9). 

The tnaximum of the likelihood under H , , is given by Equation (A.JO). 

A.^ Maximum Likelihood Estimators under H^^ and H^t^^ 

The hypothesis H^^ restricts the parameters 9 to the parametric 
sul) space: 

(A.51) V = ®0 ^ %vo- ^ ^ "^^vc"^ > 

and the hypothesis H^,^^ restricts the parameters 0 to the parametric 
subspace 

(A. 52) 0) , = [9: e 01, , 0, € OL y 

^ ^ m*vc 0 O^m'vc' 1 ^^vc 

v/here 

ind Z^^.Q - 2^3^.0 - ... - L^^^Qi . 

Let P be the common value of 3^''"^ • • v 3^^^ f and let 2^^^^^^ 

be the common value of z[}^^,I.{^^^, - ,z{Vn • Note that when p^-""^ = 3^^^ = 

11.0' 11*0' ll'U 

(k) (1) _ (2) _ _ (k) _ 



(A.55) dKp^z^^^q) > 0 , 
with equality achieved if 

a(s) = x(g) - , g = i,2,...,k. 

Also, 
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Applying Lemma A-l (and Equation (A-?)) to {A.3h), we find that when 

(1) J2) M ( ^ ,,(g)rl y v(^.\ 

with equality achieved foi"-".,,^^ ' = ' = ... = P> = ( L V^^ ; S, Vqq I 

. Hence, for all ^ \ vc" ' from (A. 55) 

and (A. 55) that • ..^ 



(A.56) h(a,p,Z^^.o) < Iz^^.o r^^Cexp 4tr(4i!o)'Sl.O^ ' 

with equality achieved in (A.56) if a^^^ = x[^^ - ^^^^W^Jw^^ , g = 1,2,... 
and 3 = W"iw^, • Applying Lemma A. 2 to the right-hand side of (A.56), we 

UvJ 01 

conclude that for all (a, p,!^^^^) e u^^^^," j- 

with equality achieved in (A. 57) if '^-^^^ = ^[^^ " ^o^^^oO^Ol ' ^^^^ " 

,(^) = ... = pC^) = w„-\, , ana z^^l - zg), = ... = . 

We conclude from (A. 51), (A. 19), (A.2lv), and (A.57) that 
(A.58) max p(x,V) = H(v)||w.J^^|iw J-^^ . 



'N 00 ' 'N ll'O 
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Since equality is achieved in (A.2l^) and (A. 57) for- [x^^^ - , 
a(g) = ^(S) _ x^s)^-!^^^ ^ g ^ 1^2,... ,k , and for ^S^^ = = = 

- v(^) = (YlV\ , it follows that under H ^ the MLE are given 
" 11*0 11 -0 ^ 

by Equation (5-12). The maximum of the likelihood under H^^ is given 
by Equation (A. 58). 

Similarly, from (A.52), (A.19), (A.29), and (a.57). it follows that 

(A.59> tnax p(^,v) = H(v) i|(W^o + A^^) l^^^^ll W^^^^ 1"^^ • 

©€0) , 

m' VC 

The maximum likelihood estimators of the parameters under H^,^^ are 

given by Equation (5.1^), as can be seen from the sufficient conditions given 

above for equality to be achieved in (A. 29) and (A. 57)- The maximum of the 

likelihood under H , is given by Equation (A.59)- 

m vc 

A. 5 Maximum of the Likelihood under H 



mvc 



The hypothesis H restricts the parameters © to the parametric 
^ mvc 

sub space 



where 
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By applying Lemma A.l separately to D(a^ P^S^^^^q) and to '^(^^^h.q) 

when (a. p-S^T n) ^ ^ ^ shorn that 

\v ~ ~11»U' l^m vc 

g— X 

with equality achieved when a^-""^ = a^^^ = ... CC^^^ = \ ~ ^o^^OO ^00^ 

("oi ^ ^oi) ' ^^^^ = ^^^^ = ••• = ^^^^ = ^^00 ^ ^oo^'^^^oi ^^01> • 

Thus, when (^P^S^lq) ^ %„i"vc" ' 

(A.Ul) h(a,p,2^^,Q) < IZ^^.Q^^ exp 4tr(2ii.o)' Q^^.q 
where 

^11.0 = ^11 ^ hi - h0^%0 ^0^^%! - ^Ol) • 

Applying Lemma A. 2 to the right-hand side of (A.lll) yields .the result that 

(A.k2) h(a, < I I Q,,., 1-^^ e^^^"'^ ... 

for all (cc,^,Z^^,q) e. cd^ , with equality when a^"""^ = a^^^ = ... = 

From (A^O), (A. 19), (A. 29), and (A.1|2) it follows that 



(A.U5) ^ max p(x,V) = H(v) | | (W^q + Aqq) | | Q^^l-O ' ^ 

= H(V) I i (W -.A) . 



©€03 

rarvc 
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■The maximum likelihood estimators of the parameters under H^^^ are given 
implicitly in Section 5*5 (and explicitly above by the conditions for 
equality in (A.29) and (A.k2)). The maximuin of the likelihood under H^^^ 
is given by Equation (k^k^). 
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